#speech recognition

[ follow ]
#speech-recognition
fromTechCrunch
1 week ago
Artificial intelligence

Speechify adds voice typing and voice assistant to its Chrome extension | TechCrunch

fromAxios
2 weeks ago
Artificial intelligence

AI's listening gap is fueling bias in jobs, schools and health care

fromInfoQ
4 months ago
Artificial intelligence

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

fromHackernoon
1 year ago
Artificial intelligence

SpeechVerse vs. SOTA: Multi-Task Speech Models in Real-World Benchmarks | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Evaluating Multimodal Speech Models Across Diverse Audio Tasks | HackerNoon

fromTechCrunch
1 week ago
Artificial intelligence

Speechify adds voice typing and voice assistant to its Chrome extension | TechCrunch

fromAxios
2 weeks ago
Artificial intelligence

AI's listening gap is fueling bias in jobs, schools and health care

fromInfoQ
4 months ago
Artificial intelligence

Mistral Voxtral is an Open-Weights Competitor to OpenAI Whisper and Other ASR Tools

fromHackernoon
1 year ago
Artificial intelligence

SpeechVerse vs. SOTA: Multi-Task Speech Models in Real-World Benchmarks | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Evaluating Multimodal Speech Models Across Diverse Audio Tasks | HackerNoon

fromTechzine Global
3 weeks ago

AI speech model aiOla Drax outpaces OpenAI & Alibaba

As explained in this video, flow-matching-based generative methods are a class of models that learn a "continuous vector field" in order to manage and transform what are relatively simple "noise distributions" into more complex data distributions. They do this by following ordinary differential equations. Instead of learning "discrete denoising steps" (that's what diffusion models do), they train the flow to match probability paths directly between data and noise.
Artificial intelligence
Startup companies
fromTechCrunch
1 month ago

Subtle Computing's voice isolation models help computers understand you in noisy environments | TechCrunch

Subtle Computing builds device-specific voice isolation models that preserve device acoustics to capture clean, personalized speech in noisy environments and outperform generic solutions.
Artificial intelligence
fromFast Company
1 month ago

Inside Microsoft's quest to make Windows 11's AI irresistible

Windows 11 introduces Copilot Voice to enable spoken interactions with AI and spoken responses, continuing decades of Microsoft voice-computing efforts.
fromSearch Engine Roundtable
1 month ago

Google Voice Search Now Using Speech-to-Retrieval (S2R)

At its core, S2R is a technology that directly interprets and retrieves information from a spoken query without the intermediate, and potentially flawed, step of having to create a perfect text transcript. It represents a fundamental architectural and philosophical shift in how machines process human speech.
Artificial intelligence
Artificial intelligence
fromFortune
2 months ago

I tried the viral AI 'Friend' necklace everyone's talking about-and it's like wearing your senile, anxious grandmother around your neck | Fortune

An always-listening AI necklace marketed for contextual emotional support failed to deliver reliable, timely, or truly contextual help during an emotional crisis.
fromClickUp
2 months ago

Voice Recognition vs Speech Recognition: What You Need to Know

You've probably used both technologies this week without realizing it. When Siri transcribes your text message, that's speech recognition. When your banking app verifies it's you speaking, that's voice recognition. The terms are often used interchangeably, but they address completely different problems. And as artificial intelligence gets better at faking human speech, understanding voice recognition vs. speech recognition becomes critical for anyone building secure systems.
Artificial intelligence
Gadgets
fromDesign Milk
3 months ago

Timekettle W4 AI Interpreter Earbuds Streamline Translation

Timekettle's W4 AI Interpreter Earbuds provide near-instant, AI-powered multilingual translation with Bone-voiceprint sensors, dual-voice pickup, noise filtering, and 98% accuracy across 42 languages.
#ai
Education
fromEntrepreneur
3 months ago

Use Rosetta Stone to Impress Clients Around the World with Fluent, Natural Speech | Entrepreneur

Lifetime Rosetta Stone access to 25 languages with speech-recognition and immersive lessons is available for new users for $148.97 using code FLUENT until September 7.
fromTheregister
3 months ago

Transcription app Otter.ai accused of illegal recordings

"Otter tries to shift responsibility, outsourcing its legal obligations to its accountholders, rather than seeking permission and consent from the individuals Otter records, as required by law."
Privacy professionals
fromClickUp
3 months ago

Whisper vs. Google Speech-to-Text: Which One Should You Use?

Whisper excels in multilingual transcription, supporting a variety of languages and offering consistent accuracy, making it suitable for global applications and media projects.
Artificial intelligence
#ai-technology
[ Load more ]